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AN APPLICATION OF CODING THEORY 
TO ESTIMATING DAVENPORT CONSTANTS 

ALAIN PLAGNE AND WOLFGANG A. SCHMID 



Abstract. We investigate a certain well-established generalization of the Dav- 
enport constant. For j a positive integer (the case j = 1, is the classical one) 
and a finite Abelian group {G,+,0), the invariant Dj{G) is defined as the 
smallest £ such that each sequence over G of length at least I has j disjoint 
non-empty zero-sum subsequences. We investigate these quantities for elemen- 
tary 2-groups of large rank (relative to j). Using tools from coding theory, we 
give fairly precise estimates for these quantities. We use our results to give 
improved bounds for the classical Davenport constant of certain groups. 



1. Introduction 



For a given Abelian group {G,+,0), the Davenport constant, denoted D{G), is 
defined as the smallest integer i such that each sequence over G of length at least £ 
has a non-empty zero-sum subsequence, i.e. the sum of its terms is 0. Equivalently, 
^ ' D(G) is the maximal length of a minimal zero-sum sequence over G, i.e. the 

0^ ! maximal length of a sequence of elements of G summing to and with no proper 

J£^ ' subsequence summing to 0. 

It is considered as a central object in combinatorial number theory since Dav- 
enport popularized it in the 60's (as reported in [25]), notably for its link with 
C-^ I algebraic number theory, see e.g. [I7] or [16]. In fact it seems that the first paper 

that deals with this invariant was written by Rogers 28], who himself attributes 
the paternity of the problem to Sudler. 

This invariant has become the prototype of algebraic invariants of combinatorial 
flavour. Since the 60's, the theory of these invariants has highly developed in several 
j^ ' directions; see for instance the survey article [15] or Chapters 5, 6, and 7 of |17l . 

H I Let G be written, as is always possible, as a direct sum of cyclic groups G = 

Cm ® • • • ® C„^ with integers 1 < ni | • • • | n^ (r denotes the rank of G, and Ur the 
exponent, except for r — where the exponent is 1). Then, the basic lower bound 
for Davenport constant is 

r 

(1) D(G)>l + ^(n.-l); 

to see this, note that a sequence containing only, for each cyclic component G„. 
(1 < z < n), one generating element Ui — 1 times, has no non-empty zero-sum 
subsequence. 

It is known that for groups of rank at most two and for p-groups {p, a prime) , 
inequality ([T]) is in fact an equality; this was obtained independently in [llj and 
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For groups of rank at least four, equality is definitely not the rule (see 
[H [TTJ [E]). In the case of groups of rank three, it is conjectured that equality 
always holds but this conjecture is wide open (see jlSI). 

Concerning upper bounds, the best general result is the following 

D(G) <exp(G)fl + log '^' 



exp(G) 

proved in [T2l[23] . 

In view of the depicted situation, it appears that it is generically very difficult to 
determine the Davenport constant of a group (of rank at least three). In particular, 
despite various works related to the Davenport constant over the years, its actual 
value was only determined for a few additional - beyond the ones known since the 
end of the 60's - families of groups; see |3] for a recent contribution. 

The j-wise Davenport constants are defined depending on a positive integer j. 
We define Dj{G) to be the smallest £ such that each sequence over G of length at 
least i has j disjoint non-empty zero-sum subsequences. Equivalently, the maximal 
length of a zero-sum sequence over G that cannot be decomposed into j + 1 non- 
empty zero-sum sequences. Evidently Di(G) = D(G) and for any positive j one 
has Dj{G) <jD{G). 

This variant of the Davenport constant was introduced by Halter-Koch [19j in 
order to determine the order of magnitude of the counting function of the set of 
algebraic integers of some number field that are not divisible by a product oi j + 1 
irreducible algebraic integers of this number field. It is 

^ '■ ■ ^D,(G)-l 



logx 



(log log a;) 



where G denotes the ideal class group of the number field considered. Moreover, 
knowledge of these invariants is useful in investigations of the Davenport constant 
itself (cf. below for details and recall e.g. that a considerable part of [3], determining 
D(C| ® C^n), is devoted to determining Dj{C^) and closely related problems). 

The case of cyclic groups is the simplest one since then it is easy to see that 
jn — 1 < Dj{Cn) < jD(G„) = jn (for the lower bound, simply take a generating 
element repeated that number of times). The case of groups of rank two is also 
known [TH], as well as the case of certain closely related groups [T71 Section 6.1]. 
But in general computing (even bounding) Dj(G) is quite more complicated than 
for D(G), in particular for (elementary) p-groups. For example, Dj(G) for all j, is 
only known for the following elementary p-groups of rank greater than two: G|, 
G|, G|, and Cl (see [5J[I1[3]). 

The main difference is that it seems that the types of arguments used to de- 
termine D(G) for p-groups cannot be applied. At first this might seem surprising, 
however recall that the same phenomenon is encountered for the invariant r]{G) 
(cf. Section [5]) and other closely related invariants such as the Erdos-Ginzburg-Ziv 
constant. In particular, the difficulty of the problem seems to increase with the 
rank of the groups considered (to be precise, the key quantity is the size of the rank 
relative to the exponent), as then D(G) and ?7(G) are far apart (cf. Section [5] for 
the relevance of this fact). 

In this paper and from now on, we focus on the case of elementary 2-groups with 
large rank. The reason for this is two-fold. On the one hand, it is an interesting 
case; the rank is 'maximal' relative to the exponent. On the other hand, the special 
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nature of the group allows certain arguments that fail in more general situations; 
for instance recall that to show ri{C2) — 2'' is almost trivial, yet the problem of 
determining rj{C'L) for any (fixed) odd prime p and arbitrary r is wide open (it is 
even open, which order of magnitude is to be expected). 

While it is easy to determine that Di(C2) = r + 1 - we effectively consider a 
vector space over a field with a unique non-zero element, and having no non-empty 
zero-sum subsequences is thus equivalent to linear independence - the situation 
becomes more complicated for larger values of j. We observe that 

r < Di(C2^) < D,{Cl) < jD,{C^,) < j{r + 1). 

Thus, for a fixed positive integer j, the sequence {Dj{C2))refi has to grow linearly. 
Our main aim here is to make this statement more precise and to study the following 
relevant quantity 

r 
of which we examine the asymptotic behaviour when r becomes large. The quan- 
tities 

DJC^) D,(C^) 

a, = liminf ■' and (3. = limsup ■' 

r^+oc r r^+OD r 

will be considered and investigated. By the above crude reasoning we only get that 
1 ^ Q^j ^ i^j ^ j- Our first theorem, which gives explicit estimates for small values 
of J, improves on this. It is a generalization of results by Komlos (lower bound, 
quoted in PH [S]) and by Katona and Srivastava (upper bound) ,20, for the case 
j — 2 that we recall for the sake of completeness in the statement of the theorem. 



Theorem 1. For each sufficiently large integi 


;r r 


we have 


1.261 r 


< D2(C^) 


< 


1.396 r, 


1.500 r 


< D3(C20 


< 


1.771 r, 


1.723 r 


< D,{C^2) 


< 


2.131 r, 


1.934 r 


< D,{C^2) 


< 


2.478 r, 


2.137 r 


< D6(C^) 


< 


2.815 r, 


2.333 r 


< D7(C^) 


< 


3.143 r, 


2.523 r 


< D8(C^') 


< 


3.464 r. 


2.709 r 


< Dg{C^) 


< 


3.778 r, 


2.890 r 


< D,o{CE) 


< 


4.087 r. 



As will be apparent from our arguments, having our method at hand, to expand 
this list further is merely a computational effort. 

Notice that the question whether aj — /3j (independently of the value that this 
constant would take) seems not obvious. 

After the study of small values for j, we turn our attention to the case of large j's. 
Although we are unable to prove that aj and /3j are equal, which seems conceivable, 
we show that they at least grow at the same speed and more precisely determine 
their order of magnitude. 

Theorem 2. When j tends to infinity, we have the following: 

log 2 f_^) <hminfMil<iimsup^^^<21og2 ^^ 



log J/ "" '•^+00 r r^+00 r "" y^ogj 
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We believe that the bound for the hmmf is cfoser to the actual value. A heuristic 
suggests that the limsup in Theorem [2] is in fact close to log 2 (j/logj) as well. 
More precisely, we formulate the following conjecture. 

Conjecture 3. For any positive integer j, the limit 

7j = hm ■' ^ 

r— >+oo r 

exists and one has 

7, ~ log 2 (-^ 

as j tends to infinity. 

For results in the converse scenario, that is fixed but arbitrary r, and j goes to 
infinity, see [l4] . 

One of the main reasons for studying j'-wise Davenport constants is the fact 
that they are important in obtaining results on the Davenport constant itself. The 
connection is encoded in the following inequality, due to Delorme, Ordaz, and 
Quiroz 8 . For a finite Abelian group G and a subgroup H one has 

(2) D{G)<DoiH){G/H). 

Among others, this inequality encodes the classical form of the inductive method, 
originally introduced to determine the Davenport constant for groups of rank two 
(see [liinilSS]). 

We can apply our results on the j'-wise Davenport constants to obtain improved 
bounds on the Davenport constant for certain types of groups (the 2-rank has to 
be 'large' relative to the order). We only formulate it explicitly for a quite special 
type of group, which however, due to its extremal nature, is of relevance in this 
context. 

Corollary 4. When n tends to infinity, we have 

hm sup < 2 log 2 



>+oo r "" logn' 

For comparison, the general bound mentioned above yields only 

hm sup ^ 2 log 2 n. 

r— >-+oo r 

We immediately give the short proof of this result. 

Proof. By ©, we get that 

D(Cr' ® C2n) < Dd(c„)(C^2) = D„(CD. 
By Theorem [21 the claim follows. D 

We finish this Introduction with outlining the plan of the present article. In 
Section [5J we explain the methods and the prerequisites we need in the course 
of this article. In Section |3l we derive the lower bounds of our two results while 
in Section |4] the upper bounds are proved. Finally, in Section [Sj we discuss the 
heuristic leading to Conjecture [3l If true, this heuristic would establish at least the 
asymptotic equivalence of aj and /3j and imply the second part of Conjecture |3l 
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2. The methods 
We outline the methods we use to estabhsh our resuhs. 

2.1. Zero-sum subsequences of bounded length. Let G be an Abehan group. 
For X a real number, let s<x{G) denote the smallest element £ € N U {+00} such 
that each sequence of length at least £ has a zero-sum subsequence of length at 
most X. Evidently, s<a;(G) = s<l2,j(G), yet for technical reasons it is useful to 
define s<xiG) for non- integral x as well. 

A prominent special case of this definition is x — exp(G'), the resulting invariant 
is typically denoted by 77(G); note that for x < exp(G), one has s<2;(G) = +00. 
Also, note that for x > D(G) we have s<x{G) — D(G). 

For results on 5<x{G), for generic x, mainly for elementary 2- and 3-groups, see 
e.g. [SI 131 J and [^ for a very closely related problem (cf. below). For recent results 
on 7y(G), focusing on lower bounds, see [T0ll9]. 

To determine 77(G) seems to be a very difficult problem in general. For example, 
ri{C^) was determined only recently [27], despite the fact that the problem of deter- 
mining ?/(GJ) is fairly popular (see [lOj for a detailed outline of several problems, 
and their respective history, that are equivalent to determining r]{G^)). 

Yet, in the case of elementary 2-groups, to determine both 77(G2) and D(G2) 
is almost trivial. However, for other values of x even for elementary 2-groups the 
problem of determining s<3;(G2) is not at all trivial, namely, it is equivalent to a 
central problem of coding theory (cf. below) . 

It is known, in particular by the work of Delorme, Ordaz, and Quiroz [5], that the 
invariants 5<x{G) can be used to derive upper bounds for Dj{G). More specifically, 
we have (this is Lemma 2.4 in [14j ) 

(3) Dj+i(G) < minmax{Dj(G) +i,s<,(G) - 1}. 

Thus, knowing Di(G) — D(G) and the constants s<i(G), or bounds for these con- 
stants, one can obtain, recursively applying estimate ([3]), bounds for Dj(G). Notice 
however that even exact knowledge of D(G) and s<i(G) for all i can be insufficient 
to determine Dj{G) exactly via this method, which in general is not optimal. 

2.2. Coding theory enters the picture. We recall that the link between coding 
theory and combinatorial number theory is not new. For instance, Cohen, Litsyn, 
and Zemor ^ used coding theoretic bounds in the Sidon problem. The general 
paper [7] by two of these authors provides a worthwhile introduction to the links 
between the two problematics. Also, Freeze [13] used coding theory in the present 
context. 

One of the reasons for this connection is the following folkloric lemma [21] . 

Lemma 5. The minimal distance of a binary linear code C is equal to the minimal 
length of a zero-sum subsequence of columns of a parity check matrix of C . 

In the case of present interest, Cohen and Zemor [7] pointed out a connection 
between s<i(G2) and coding theory. As the situation at hand is slightly different 
from the one in that paper, and this connection is central to our investigations, we 
recall and slightly expand it in some detail. 

First, we give a technically useful definition. In the present context, we call 
a function / : [0, 1] — ?► [0, 1] upper-bounding if it is decreasing (not necessarily 
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strictly), continuous and each [n, k,d] code satisfies 

n \n 

In other words, upper-bounding functions are the functions intervening in the upper 
bounds of the rate of a code by a function of its normalized minimal distance. 

We call a function asymptotically upper-bounding if it has the same properties as 
an upper-bounding function, except that the inequality only has to hold for [n, fc, d] 
codes with sufficiently large n. 

Notice that in both cases, the assumptions on decreasingness and continuity are 
not restrictive at all since these assumptions are usually fulfilled. 

Lemma 6. Let f be an upper-bounding function. Let d, n, and r be three positive 
integers satisfying 2 < d < n — 1 and 

n — r f d-\-l 
>/ 



then 

s<d{Cl) < n. 

Moreover, the same assertion holds true for f an asymptotically upper-bounding 
function if we impose that n is sufficiently large (depending on f ). 

Proof. Let S = gi, . . . ,gn be an arbitrary finite sequence over C2 . We shall prove 
that it contains a zero-sum subsequence of length at most d. 

It is immediate that S has a zero-sum subsequence of length 1 if and only if 
occurs in S, and that S has a zero-sum subsequence of length 2 if and only if some 
element occurs at least twice in S. Thus, since d > 2, we may assume that S does 
neither contain nor an element more than once i.e. we effectively have to study 
the case of subsets of C2 \ {0} (and not the one of general sequences in C2). 

We assert that we may assume that the elements appearing in S generate Cj . To 
see this, note that if g is an element of C2 not contained in the subgroup generated 
by the elements of S, and if T denotes the sequence obtained by appending g to S, 
then each zero-sum subsequence of T is in fact a zero-sum subsequence of S. Thus, 
if S has no zero-sum subsequence of length at most d and the elements of S do not 
generate C2, then the longer sequence T, defined as above, neither has a zero-sum 
subsequence of length at most d. So, it suffices to establish an upper bound on the 
length of sequences S such that the elements appearing in S generate Cj . 

We choose some basis of CJ. We consider the binary linear code C C C^ of 
length n whose parity check matrix is ^ = [(/i | • • • | gn] € A^r.n (identify the gi's 
with their coordinate vectors with respect to some basis of Cj, and consider them 
as column vectors, and use the just chosen basis of C^). Notice that the rank of 
this matrix is equal to r in view of our assumption that the g^'s generate C2 . Let 
TO be the minimal distance of C By definition, the code C is an [n, n^ r,m] binary 
linear code. But by assumption since / is upper-bounding and 

n — r „ / d -\- 1 
>/ 



an [n,n — r,d + 1] code cannot exist. This implies that m < d + 1, or equivalently 

d > TO.. 



CODING THEORY AND DAVENPORT CONSTANTS 7 

We conclude by applying Lemma [5] which shows that S possesses a zero-sum 
subsequence of length m. 

The additional claim for asymptotically upper-bounding functions, is immediate 
in view of the just given argument. D 

3. Lower bounds 

In this section, we establish the lower bounds for Dj{C2), for large r, contained 
in Theorems [1] and [H Specifically, we prove the following asymptotic lower bound 
in r, which immediately yields both lower bounds (replacing log(j -I- 1) by logj is 
asymptotically, in j, irrelevant). 

Proposition 7. Let j be a positive integer. Then 

as r tends to infinity. 

Notice that the case j = 1 is essentially trivial, while the case j ~ 2 of this result, 
formulated in the context of coding theory, is attributed to Komlos in ^ and [21] . 
Indeed, our proof will generalize Komlos' approach (in the form given in [5]). This 
proof can be seen as probabilistic, yet we prefer to present it via a direct counting 
argument. Notice that it is non-constructive. 

We need the following well-known lemma (see e.g. [2T|). 

Lemma 8. Let n and k be two positive integers, n > k. In an n-dimensional vector 
space over a field with 2 elements, the number of k- dimensional subspaces is equal 
to the 2-ary binomial coefficient defined as 

n] _ (2" - 1) • • • (2"-'=^+i - 1) 
k\ ^ (2'=-!) •••(2-1) ' 

Moreover, the number of k -dimensional subspaces containing a fixed j- dimensional 
subspace, k > j , is equal to 

'n - f 

k - j 

We can now prove Proposition [T) 

Proof of Proposition [^ For the entire proof, we fix an arbitrary positive integer 
j > 1. As Di(C2) = D(C2) = r -|- 1, we can ignore the case j = 1. 

We shall now prove that for each integer n larger than or equal to r + j (this 
condition is technically convenient later on) and less than (jlog2/log(j -I- 1)) r 
(notice that, for r large enough, since j > 2, such n's always exist), one can find a 
sequence of cardinality n which does not contain j disjoint zero-sum subsequences. 
This will prove our result. 

To each sequence S = gi, . . . ,gn over Cj , with n > r + j, having the properties 
that S does neither contain nor an element at least twice, we associate, as de- 
scribed above, an [n,n — r] code (contained in C^, and we fix some basis). This 
linear code, automatically, has a minimal distance of at least 3. Conversely, any 
[n, n — r,d] code with d > 3 can be obtained in this way (cf. |7J). 

The following remark is central: the condition that S has j disjoint zero-sum 
subsequences translates to the condition that the associated code contains j non- 
zero codewords ci, . . . ,Cj such that intersection of the support (the set of indices 
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of non-zero coordinates) of Cu and Cv is empty for all distinct u, w G {1, . . . , j}. A 
code having this property will be called j -inadmissible, otherwise it will be called 
j -admissible. 

We first produce an upper bound on the total number of [n, n — r] codes that 
are j-inadmissible. 

By definition any j-inadmissible code contains Ci , . . . , Cj with the above men- 
tioned property. These c^'s generate a j-dimensional vector space since the c^'s 
are certainly independent: the non-zero coordinates of each Ci are unique to that 
element. 

Let V denote the set of all subsets {di, . . . ,dj} C C2 \ {0} such that the inter- 
section of the support of d„ and dy is empty for all distinct m, w S {1, . . . , j}; thus, 
in particular, all the di's are distinct. 

We note that a code C is j-inadmissible if and only if V^ C C for some V E V (this 
V is not necessarily unique). Moreover Lemma [5] implies that for each V £ V there 
are [^"^-i] codes containing V; note that if ^ C C then C also contains the vector 
space generated by V, which is j-dimensional, and apply Lemma [5] It follows that 
the total number of j-inadmissible codes cannot exceed 



IVI 



n-r - j 



In order to estimate |V| we make the following remark: each element of {1, . . . , n} 
has to belong to either the support of exactly one of the di 's or to none (obviously, 
the information which element of {I, . . . ,n} belongs to each of the di's uniquely 
determines the element of V). Thus, for each element of {1, . . . ,n} there are (at 
most) j + 1 possibilities. This readily gives |V| < (j -I- 1)". (We ignore the slight 
improvements that could be obtained from the fact that the ordering of the di's is 
irrelevant and the supports are non-empty, as they would not affect our estimate). 

We therefore infer that the total number of j-inadmissible [n, n ~ r] codes is 
bounded above by 



(j + ir 



Again by Lemma[8l it follows that the ratio of the total number of j-inadmissible 
,n — r] codes divided by the total number of [n, n — r] codes is bounded above by 



^' ir = (^■+^)" n V^ 

In-ri k=n-j+l 



< O' + ir n 



'^^ i^k — r 



2^ 

k—7i—j-\-l 

= (7 + 1)"2~''-' = 2"^°^^^^^^^"^-' 

Here, logj refers to the logarithm in basis 2. 

Thus, it follows that as soon as (nlog2(j + 1) — rj) is negative, that is 

n 7 

- < 



r log2(j-hl)' 
the existence of at least one admissible code is guaranteed. 
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From this we deduce (the condition n> r + j becomes irrelevant) 

as r tends to infinity. D 

4. Upper bounds 

4.1. The crucial lemma. The following lemma is central for our investigations. 

Lemma 9. Let f be an asymptotic upper-bounding function. Let j be a positive 
integer and p be a real number such that one has Dj{C2) < pr for each sufficiently 
large r. Let finally c denote a solution to the inequality 

P + c-2^^ 

p + c \p + c^ 

Then for each sufficiently large integer r, we have 

D,+i(C2^)<(p + c)r. 

Proof. By assumption, we have that 

{p + c)r -r _p + c-l / c \ _ / er \ / cr + 1 



{p + c)r p + c \p + cj \{p + c)r J \{p + c)r ^ 

f being decreasing. If we substitute \_{p + c)r\ for n and \_cr\ for d, we obtain (for 
r sufficiently large and by the continuity of /) 

n — r /d + 1 

>/ 

n \ 1^ 

an equation of the type given in Lemma [5] and we may therefore deduce that 

s<[crj (Ca) <n=[{p + c)r\ < {p + c)r. 
This now implies, by ([3]) and using our assumption, that 

D,+i(C2) < minmaxjpr + i,s<i(C2) — 1} 

< max{pr + \cr\ , s< Lc,.j (C2 ) - 1 } 

< max{(p + c)r, [p + c)r — 1} = (p + c)r, 

as wanted. D 

4.2. The upper bounds in Theorem[ll To obtain a proof of these upper bounds, 
we use the approach described in Section [2] in combination with a bound on the 
parameters of linear codes originally due to McEliece, Rodemich, Rumsey, and 
Welch 22] I that we recall here (see e.g. [21]): Let us define h to be the binary 
entropy function, that is (for < u < 1), 

hiu) — — iilog2 u — {\ — u) log2(l — u) 



and g{u) — h{{l — \/l — u)/2). Then the function / equal to 

(5) = [mino<n<i-2d- (l + g{u^) - g{u^ + 2<5m + 2^)) if ^ < 1/2 
1 otherwise 

is an asymptotically upper-bounding function and we may thus apply Lemma [9l 
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We define a sequence (uj)jeN recursively. We set ui = 1 and, for j > 1, let Uj+i 
be defined as the solution (if it exists, which is always the case in practice, this 
solution has to be unique) to the equation 

1 _;/ ".+1 



Uj + Uj+i \ Uj + Uj+i 

where we define Uj — ui + ■ ■ ■ + Uj, the sum of the j first values of the sequence 
{uj)jeN- The sequence (C/j)jgN corresponds to the coefhcient in the upper bound 
of Theorem [U (C/i = 1, t/g = 1.395 . . . , t/3 = 1.770 ...,...), which therefore follows 
by a repeated application of Lemma [5] 

We point out that for our problem it is actually useful to use this bound, as 
opposed to bounds whose numerical evaluation is simpler, since for our problem 
we encounter ^ in a fairly wide range. A simpler strategy, regarding computations, 
which is - in view of classical results on these bounds - obviously worse, though 
only slightly so, would be to use another bound proved in [22] (see also [21]), that 
is the function 

^^^^^^ih{l/2-^6iT^) if -5 < 1/2 

[^ otherwise 

for the first few values, namely 2,3,4 (for 2 this yields the identical bound, yet a 
slightly weaker one for 3, 4) where S is still fairly large, and then to switch to using 
a bound that is better for small S such as the Elias-Bassalygo bound (see [5] and 
pi]). that is 



/3(^) = J ^ " '^ ((1 " ^^^^^)/2) if ^< 1/2 
1 otherwise 

with h as above (this is the case w = of (|4.2p ). Using this approach, we would for 
instance get the values 1.776 for j = 3, 2.147 for j = 4, 2.512 for j — 5 and 4.172 
for j = 10 that is, slightly but noticeably weaker bounds. 

4.3. The upper bound in Theorem [2j For the asymptotics we use the method 
described in Section[2|in combination with the Hamming bound, that is the function 
/4 defined by 

/4(^)-i-; ^^^ 



We use this bound as the resulting analytic expressions and asymptotic calcula- 
tions are simpler than for stronger bounds. On the other hand, asymptotically, 
using say Elias-Bassalygo would not yield a better result. Interestingly, for this 
particular choice of upper-bounding function, a very elementary proof (cf. below) 
of the conclusion of Lemma IHl can be obtained with additive means, avoiding the 
coding theoretic argument that this function is an upper-bounding function. This 
makes our proof of Theorem [2] essentially self-contained. 

Proof. We start with a sequence 5' = 51, . . . , 5„ in C2 , where n = [{p + c)rj . The 
set {1, . . . ,n} has 

^''^W[iP + c)r\\ f[{p + c)r\ 
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subsets of size at most cr/2. Since by assumption 

and since, as r tends to infinity, (^^) ~ c 2" KP/o'V (for some constant c depending 
on a and /3), it follows that, for sufficiently large r, the number of such sets exceeds 
2''. This implies that there exist two distinct subsets I, J of {1, . . . , n} such that 

This yields 

E 9. = 0, 
ieiAj 

- where lAJ denotes the symmetric difference of I and J -, that is a non-empty 
zero-sum sequence of length at most cr. D 

Using the upper-bounding function /4, we build a sequence, let us call it (uj)jgN 
such that wi = 1 and 

(4) 1 = h ( ^^ 

Vj+v,+^ \2{Vj+v,+i), 

where Vj — vi + ■ ■ ■ +Vj for any integer j > 1. It can be checked easily that such 
a sequence is well-defined and that for any j > 1, one has Vj < 1. Then, rewriting 
(HI) as 



we may repeatedly apply Lemma |9] and obtain the inequality 

D,(CD < V, r 

for each integer j and all sufficiently large r (relative to j). 

By the lower bounds in Theorem [2] proved in the preceding Section, we already 
know that Vj tends to infinity when j tends to infinity since 

r logj 

as r tends to infinity ; while Vj remains bounded by 1. 

We can now develop (|3]) to obtain the desired asymptotics. At the first order 
when j tends to infinity we obtain 

— ^ — "^-^^ iogJ ...."^:+^ . )+o^ '^+^ 



Vj + vj+i 2{Vj + Vj+i ) °'\2{Vj+ Vj+i )) \Vj+ v,+i 

or equivalently 



Vj+i \2{Vj+Vj+i)^ 

= log(VS- + v,+i) - log v,+i + 0(1) 
= logy,-logz;j+i+0(l), 
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and therefore 

^ + logwj+i = logFj + 0(1). 

Vj+l 

It follows that, as j tends to infinity, 



^j+i 



By ([5]), we obtain 



y^+i - ^7 = "".7+1 ^ 



2 log 2 

2 log 2 



J 



logj 
and therefore, summing all these estimates yields 

y, <21og2^-— ^21og2 . 
^logfc log J 

from which the upper bound of Theorem [2] follows. 

5. Heuristics 

In this section, we discuss the quality of the bound in Theorem [1] In investi- 
gations on intersecting codes, which are equivalent to determining D2(C2), Cohen 
and Lempel [5^ put forward the heuristic that one can expect that the rate of an 
intersecting code will not exceed the Gilbert- Varshamov bound 

l-h{S). 

Extrapolating this heuristic to the investigation of Dj{C2), which might be too op- 
timistic as the restrictions imposed on the code corresponding to extremal example 
associated to Dj{C2) get weaker as j increases, this suggests to use the function 
1 — h{S) as if it were an upper-bounding function. 

An argument similar to the one in the proof of Theorem [2] thus yields an opti- 
mistic heuristic bound Bj < log 2 7^. 

t^J r^ t> log J 

Regarding heuristic numerical values we get, for example, 1.294 for j = 2, 1.550 
for j = 3, 1.784 for j = 4, 2.003 for j = 5, and 2.984 for j = 10. 
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